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Abstract 


1 
2 

3 We developed a modern methodology to retrieve water (H 2 O) and deuterated water 

4 (HDO) in planetary and cometary atmospheres, and constructed an accurate 

5 spectral database that combines theoretical and empirical results. Based on a 

6 greatly expanded set of spectroscopic parameters, we built a full non-resonance 

7 cascade fluorescence model and computed fluorescence efficiencies for H 2 O (500 

8 million lines) and HDO (700 million lines). The new line list was also integrated into 

9 an advanced terrestrial radiative transfer code (LBLRTM) and adapted to the CO 2 

10 rich atmosphere of Mars, for which we adopted the complex Robert-Bonamy 

11 formalism for line shapes. We then retrieved water and D/H in the atmospheres of 

12 Mars, comet C/2007 W1 (Boattini), and Earth by applying the new formalism to 

13 spectra obtained with the high-resolution spectrograph NIRSPEC/Keck II atop 

14 Mauna Kea (Hawaii). The new model accurately describes the complex morphology 

15 of the water bands and greatly increases the accuracy of the retrieved abundances 

16 (and the D/H ratio in water) with respect to previously available models. The new 

17 model provides improved agreement of predicted and measured intensities for 

18 many H 2 O lines already identified in comets, and it identifies several unassigned 

19 cometary emission lines as new emission lines of H 2 O. The improved spectral 

20 accuracy permits retrieval of more accurate rotational temperatures and production 

2 1 rates for cometary water. 
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1. Introduction 


2 Water is among the most searched molecules in the Universe, owing to its role as a 

3 hydrogen repository and its strong connection with life on Earth. Even though it 

4 comprises just 0.02% of Earth's mass, all known forms of life depend on water 

5 through its role in metabolism. Our definition of "habitability” is thus strongly linked 

6 to water abundance, ultimately driving its search across the Universe. 

7 Water has been sought and found in extremely diverse astronomical environments, 

8 from the cold interiors of comets [1] to the hot atmospheres of exoplanets [2] and 

9 the even hotter atmospheres of stars [3], The recent discovery of water in 

10 proto-planetary disks [4], and further isotopic measurements in comets [5], are 

11 used to probe the beginnings of our Solar System. As a strongly polar molecule, 

12 water has a relatively high sublimation temperature when compared to the common 

13 apolar gases (H2, CHU, NH3, H2S, CO2, O2, N2, etc.). This property molded our 

14 Planetary System, separating the formative zone of rocky terrestrial planets 

15 (Mercury, Venus, Earth, Mars) from that of the gas-rich Jovian planets by the frost 

16 line (at ~2.7 AU, [6]) where water ice first becomes stable. 

17 Our capabilities to measure water remotely have expanded greatly with the recent 

18 advent of powerful high-resolution spectrometers atop high altitude mountains and 

19 in space. In concert with this evolution, the detection of high temperature water in 

20 sunspots has driven exponential growth of spectral databases that characterize the 
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1 radiative properties of water, now reaching more than 1,200 million lines for its 

2 isotopologues (see Tables 1 and 2). 

3 Remote sensing of water has not been restricted to abundance measurements, but 

4 includes the characterization of its spin-isomeric (ortho, para) and isotopic forms 

5 (H 2 O, HDO) that trace the environment in which the water molecules formed. The 

6 nuclear spin temperature (derived from the ortho/para abundance ratio of water, 

7 hereafter OPR) may be sensitive to formation temperatures lower than 50K; above 

8 this value the spin-isomer populations are described by statistical equilibrium 

9 (OPR = 3, see Fig. 1 of [7]). Moreover, models of nebular gas-phase chemistry (e.g., 

10 [8]) predict important enrichments of deuterated water at low temperatures 

11 (T<80K). Since both ratios (nuclear spin species, and isotopologues) are very 

12 sensitive to the temperature at which the molecules initially formed, they are now 

13 being used to better understand the primordial conditions of our Planetary System. 

14 Water is (by far) the most abundant primary volatile in cometary nuclei, and the 

15 abundances of other species are expressed relative to it (C2H6/H2O, CH3OH/H2O, 

16 HCN/H 2 O, etc.). These "mixing ratios" with respect to water are the principal 

17 metrics for the currently developed taxonomic classification of comets based on 

18 primary volatile composition (i.e., H 2 O is the “baseline" for the taxonomy, and 

19 therefore its accurate modeling is critical). 

20 
21 
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1 1.1. Detection of cometary water by fluorescent emissions. 

2 Historically, the high opacities at the core of telluric water lines prevented 

3 measurements of water and its isotopologues from ground-based observatories. 

4 However, atmospheric transmittance improves rapidly with increasing altitude 

5 owing to steadily decreasing water abundance. This fact drove the development of 

6 high altitude observatories and ultimately airborne platforms (e.g., the Kuiper 

7 Airborne Observatory, KAO), that allowed the exploration of new spectral regions 

8 and enabled many unique discoveries. A prime example is the detection and 

9 characterization of water vapor in Halley's comet [1,9,10] using the high-resolution 

10 Fourier Transform Spectrometer [X/AX ~ 10 5 ) onboard the KAO. This pioneering 

11 work not only led to the first measurement of water vapor and its nuclear spin 

12 species in a comet, but it also revealed the complexities in the excitation of cometary 

13 water. 

14 Detailed models suggested that fluorescence driven by direct solar pumping is the 

15 main mechanism responsible for emission by cometary water at infrared 

16 wavelengths [11,12], This emission process enables the search for water at infrared 

17 wavelengths using ground-based observatories. As vibrationally excited water 

18 molecules cascade downward, they emit photons in fundamental bands and also in 

19 "hot-bands” at frequencies where telluric opacities are reduced. Although photons 

20 in the emitted fundamental bands are absorbed by atmospheric water in its ground 

21 vibrational state, the emitted hot-band lines terminate on higher vibrational levels 
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1 with much smaller populations at atmospheric temperatures, and thus they permit 

2 measurements of water in remote objects from ground-based observatories. 

3 Spectral lines of water detected in 1 P/Halley with KAO included many predicted 

4 lines of the targeted V3 fundamental band near 2.7 pm, but three lines of a hot-band 

5 (011-010, or V2 + V3 - V2) were detected unexpectedly [9], Solar pumping of other 

6 hot-bands (100-010; 001-010) was later modeled by [13]. The decision to de- 

7 commission the KAO in 1995 created an urgent need for alternative means of water 

8 detection from the ground, stimulating the development of a general hot-band 

9 method for detecting cometary water. Its first successful application was to the V2 + 

10 V3 hot-band near 2pm (vi + V2 + V3 - vi, or 111-100) in comets C/1991 T2 

11 (Shoemaker-Levy) and 6P/d’Arrest [14], and then in C/1996 B2 Hyakutake [15]. 

12 Three additional hot-bands (100-010, 001-010, and 021-010) were identified later 

13 in C/1995 01 Hale-Bopp [16] and in C/1996 B2 Hyakutake [17], and six more hot- 

14 bands were detected in C/1996 HI (Lee) [18,19], 

15 Today, we measure many such lines simultaneously in a network of ten (or more) 

16 vibrational hot-bands, and we perform an intensity analysis that characterizes the 

17 rotational temperature for the emitting water population. These developments, in 

18 concert with advances in infrared detectors, have allowed measurements of water 

19 beyond Earth’s atmosphere with unprecedented sensitivity and accuracy. 

20 
21 
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1 1.2. Recent model developments and applications to water in comets and on Mars. 

2 Although simple in concept, the retrieval of water abundance from these 

3 measurements is far from trivial when considering non-resonant (i.e., "hot-band") 

4 fluorescent emission. Computation of line-by-line fluorescence efficiencies 

5 (g-factors) entails construction of a full quantum mechanical model for the 

6 molecule. This requires a complete characterization of the rotational structure 

7 (energy levels) for all vibrational levels involved (both high-energy levels pumped 

8 by sunlight, and lower levels involved in the subsequent cascade), along with 

9 statistical weights, selection rules, perturbations (e.g., Coriolis effects, splittings, 

10 tunneling) and band emission rates. Not only is this task extremely complex, but 

11 also information for most hot-bands is not available in community spectral 

12 databases (e.g., HITRAN, GE1SA). The main driver for these community databases is 

13 the precise characterization of our own atmosphere (200-300K), so they often omit 

14 information about hot-bands, which are only populated significantly at high 

15 temperatures (> 1000K). Consequently, models of hot-band fluorescence (e.g., in the 

16 011-010, or 100-010 bands) relied on the harmonic oscillator approximation to 

17 estimate the radiative properties of these bands (e.g., [13]). Until now, this 

18 assumption limited the ultimate accuracy of retrieved rotational temperatures, 

19 column densities, and production rates for water in comets. 

20 Recent developments in molecular variational techniques for solving the nuclear 

21 motion problem have revolutionized the field of molecular spectroscopy. These 

22 methods are capable of generating accurate spectral parameters for millions of lines 
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1 when considering a precise potential energy surface (PES). Some successful 

2 examples include: H 2 O [20], HDO [21], NH 3 [22,23], HCN and HNC [24], Recently, 

3 Rothman et al. [25] combined contemporary spectroscopic data with ab initio 

4 theoretical information for certain molecules, leading to a very complete spectral 

5 database for H 2 O, CO 2 , CO, NO and OH (HITEMP2010, more details for the H 2 O 

6 component are provided in Section 2.1). 

7 Here, we present a full non-resonant fluorescent cascade model [with realistic solar 

8 pumping) for H 2 O [considering 500 million lines) and HDO [700 million lines), along 

9 with line-by-line computed fluorescence efficiencies for both isotopologues. The 

10 new model utilizes an updated set of optimized spectral parameters that draws 

11 upon newly obtained experimental and theoretical data [we also corrected several 

12 inconsistencies found in existing databases). We integrated the new line list into an 

13 advanced radiative transfer model [LBLRTM, used for computing atmosphere 

14 transmittances and radiances). We then synthesized spectra for Earth and Mars for 

15 comparison with spectra recorded with the high-resolution spectrograph at Keck II 

16 (NIRSPEC) atop Mauna Kea, Hawaii. We show selected examples that demonstrate 

17 the excellent agreement attained between measurement and model. 

18 As a final validation, we compare measured and modeled fluorescent intensities for 

19 water in comets. The new model reproduces the measured intensities of emission 

20 lines already identified as H 2 O in the cometary spectrum, and it also correctly 

21 identifies several additional lines that were previously unassigned. The confidence 
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1 limits for retrieved rotational temperatures, column densities, and production rates 

2 for water are greatly improved by use of the new model. 

3 2. Spectral databases for water (H 2 O and HDO) 

4 2.1 Available compilations 

5 In the early sixties, Gates et al. [26] published a compilation of line parameters and 

6 computed spectra for water vapor bands at 2.7 pm. This initiated an era of shared 

7 spectroscopic databases that now contain more than 50 molecules and over a 

8 hundred isotopologues. In 1973, Garing & McClatchey reported the first compilation 

9 of multiple molecules (~100,000 lines, [27]) and in 1983 Rothman et al. [28] 

10 reported a molecular database containing ~181,000 lines, which set the foundations 

11 of the now widely used HITRAN database (~3 million lines, [29]). A complementary 

12 effort was started in 1976 at Laboratoire de Meteorologie Dynamique (LMD) in 

13 France [30], resulting in the GEISA database, now containing spectral parameters 

14 for 50 molecules [111 isotopologues) with almost 4 million lines [31], Fed by results 

15 from numerous laboratory spectroscopists, these compilations have allowed a 

16 continuing revolution in remote sensing of planetary atmospheres. 

17 Extrapolation of these line parameters to other environments has not always been 

18 straightforward, in particular because the reported broadening coefficients are 

19 given for a nitrogen-rich atmosphere and the line completeness is normally 

20 restricted to bands that are strong enough at telluric temperatures [< 400K). New 
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1 laboratory experiments are now exploring other collision partners such as COz (e.g., 

2 [32]), the main atmospheric constituent of Mars and Venus. 

3 Enhancement of these databases to make them applicable at higher temperatures 

4 (>1000K) is extremely complex. At higher temperatures many more energy levels 

5 are populated, exponentially increasing the complexity of the measured laboratory 

6 spectra and ultimately constraining the ability of spectroscopists to extract 

7 line-by-line parameters. The solution to this problem emerged from theoretical 

8 studies, where the complete molecular motions are modeled and spectral line 

9 parameters are synthesized. The first steps for these studies were challenging, 

10 leading to spectral parameters of moderate accuracy and limited temperature and 

11 dynamic range (e.g., [33,34]). With the advent of powerful computers and better 

12 characterized potential-energy surface (PES) and dipole moment surface (DMS) 

13 descriptions, these theoretical studies can now synthesize extremely accurate 

14 spectral parameters for millions of spectral lines (e.g., [20]). The recent high-energy 

15 compilation of water in HITEMP [25], provides a hybrid approach in which the 

16 precision of the energy description of the BT2 database is further refined by 

17 integrating semi-empirical information present in HITRAN and in other high- 

18 temperature linelists (e.g., [35,36]). 

19 As presented in Figure 1, terrestrial databases (HITRAN and GE1SA) generally 

20 provide a complete description of the main spectral lines for H 2 O at 296K, even 

21 though they contain only a small subset of the lines contained in BT2 (~eighty per 

22 million). We refer to HITRAN and GE1SA as terrestrial databases, since they are 
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1 mainly intended to synthesize terrestrial spectra [temperatures near 296K and for a 

2 N 2 rich atmosphere). For HDO, both terrestrial databases contain practically the 

3 same spectral information, but they are relatively incomplete when compared to the 

4 ab initio VTT database [21]. Recently, a reliable list of energies for HDO became 

5 available [37] which combines information from 76 sources, and was done as a 

6 collaborative effort to provide reliable spectroscopic parameters for different 

7 isotopologues of water by the International Union of Pure and Applied Chemistry 

8 (IUPAC). We use this comprehensive survey (HDO-IUPAC) to correct the ab initio 

9 parameters (see Section 2.2). 

10 The great value of the ab initio databases is nicely revealed in Figs. 1 and 2, with the 

11 lower panel of Fig. 1 showing densities of up to ~1, 000, 000 lines per 10 cm' 1 for ab 

12 initio databases (in comparison to ~100 per 10 cm- 1 of terrestrial atlases) and Fig. 2 

13 showing that the ab initio databases have almost a complete characterization of the 

14 high energies (up to 25,000-30,000 cm 1 for both isotopologues). Not surprisingly, 

15 the biggest limitations of terrestrial databases appear at higher temperatures where 

16 high energy levels become populated. Terrestrial databases are severely limited for 

17 lines associated with energies higher than 4000 cm' 1 (Figure 2). This limitation is 

18 particularly problematic when synthesizing spectra for environments with 

19 temperatures higher than 1000K, or when computing radiation fields involving 

20 non-LTE excitation of high-lying levels and subsequent cascade (e.g., cometary 

21 fluorescence). Consequently, synthesis of spectra in these regimes requires a new 

22 compilation and validation of information contained in terrestrial and high-energy 

23 databases. 
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1 2-2 An improved compilation of Einstein coefficients and ro-vibrational energies 

2 Most of the spectral parameters contained in the terrestrial databases have been 

3 obtained using extremely high spectral resolutions, and consequently the reported 

4 energies and frequencies in these lists are normally of high precision. In contrast, 

5 such precision with ab initio methods is particularly difficult, owing to micro 

6 perturbations in the molecular motion not entirely described by the available 

7 potential-energy surfaces (PES). The big advantage of theoretical lists arises from 

8 the fact that all lines are computed from a single self-consistent solution, while 

9 terrestrial databases are in general a collection of values obtained from diverse 

10 experiments employing different calibration techniques which consequently give 

11 rise to inconsistencies in these databases. In general, terrestrial databases suffer 

12 mainly from inconsistencies in the reported intensities (see section 2.1 (H 2 O) in 

13 [29]], while ab initio databases are impacted by imprecise frequencies and energies. 

14 We also encountered certain discrepancies in the reported symmetries and line 

15 identifications for both types of databases (see Tables 1 and 2). We tested the 

16 accuracy of reported symmetries (ortho and para, for water] by comparing the 

17 reported statistical weights to values computed from the reported ro-vibrational 

18 quanta. Ortho (para] levels correspond to an even (odd) sum of the quantum 

19 numbers K a , K c , and n 3 (the number of V 3 quanta). The ro-vibrational identifications 

20 (ID) were verified by comparing the tabulated total energy of the levels 

21 (E=Erot+Evib+E e iec) with the corresponding vibrational energies (E V ib; the vibrational 

22 energies were identified by searching the levels with J=0). Lines accessing levels 


12 






1 with energies lower than the corresponding vibrational energy (i.e., negative 

2 rotational energies) were flagged as having incorrect quanta, while lines having 

3 incomplete upper (or lower) level IDs were flagged as having unknown ID. Most 

4 inconsistencies were found in the theoretical databases (see Tables 1 & 2). 

5 The labeling of levels resulting from the ab initio solutions is particularly 

6 challenging since the theoretical models provide data only for ] and the symmetry 

7 block of each energy level [20]. Modelers employ complex identification schemes, 

8 and more levels can be identified (depending on how specifically these algorithms 

9 are defined) but with a higher probability for error. For instance the BT2 line list 

10 (with a conservative labeling scheme) contains only modest errors but leaves more 

11 than 97% of the lines unidentified, while the more aggressive algorithm considered 

12 in the VTT database leaves only 20% of lines unidentified but with an 11% rate of 

13 error. 

14 Line IDs are particularly important for non-LTE investigations (e.g., cometary 

15 fluorescence) when accounting for rotational/vibrational contributions in cascade, 

16 and when comparing and correcting synthetic line parameters with spectral 

17 information obtained in the laboratory. Errors in the symmetry labeling are 

18 particularly important for our measurements of water spin temperatures in comets, 

19 since the symmetry information is necessary for retrieving the ortho-to-para ratio 

20 (OPR). Fortunately, we found relatively few inconsistencies in the BT2 database for 

21 this parameter (see Table 1). 
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1 Combining information from multiple databases is not trivial. A clear set of rules is 

2 needed to determine which lines and spectral parameters from each source should 

3 be included in the new compilation (e.g., see the discussion of H1TEMP, [25]). Our 

4 compilation and correction approach emphasized the self-consistency of the 

5 spectral parameters. In particular, fluorescence branching ratios must be computed 

6 correctly for use in non-LTE fluorescence models, in turn requiring that line 

7 intensities and Einstein 'A' coefficients be consistent among the different bands. 

8 Consequently, we adopted the Einstein coefficients of the self-consistent BT2 

9 database for our calculations, but corrected the BT2 energies/frequencies with 

10 information contained in the terrestrial databases (HITRAN, GEISA) and in the Semi- 

11 Empirical Line Positions (SELP, [35,36]) atlas at the core the H 2 O-HITEMP atlas. The 

12 BT2 database is composed of two components: 1) the energy table containing the 

13 energies of all ro-vibrational levels, and 2) a spectral table containing the 

14 Einstein-coefficients for transitions between the levels established in the energy 

15 table. On the other hand, the GEISA, HITRAN and HITEMP databases report line 

16 frequencies but lack information on the global ro-vibrational structure, and 

17 therefore different lines accessing the same level may indicate different energies for 

18 that level. 

19 We tested the consistency of the energy information contained in the terrestrial 

20 databases by computing statistics (standard deviation and weighted mean) on the 

21 reported energies of lower (E] 0W ) and upper (E up = v + Ei ow ) quantum states for all 

22 lines. For ro-vibrational states that appear in multiple terrestrial databases, the 

23 listed level energies of most (94-98%) are consistent within 0.01 cm 1 or better. 
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However, the rate decreases to 86% when we require a consistency limit better than 

2 0.001 cm' 1 (see Tables 1 and 2). By selecting the levels (from the terrestrial 

3 databases) having a consistency better than 0.3 cm' 1 , we corrected the energy tables 

4 of BT2 and VTT for all matching ro-vibrational levels. 

5 Because of the problems in the labeling of levels in BT2 and VTT, we define a 

6 "match" between a terrestrial and a theoretical level only when both IDs have the 

7 same 6 quantum numbers (m, r\ 2 , n 3 , J, K a , K c ) and the difference between the 

8 empirical and theoretical energies is less than 0.3 cm 1 (to avoid correcting 

9 mislabeled levels). The quantum numbers n x indicate the number of quanta of the v x 

10 vibrational mode (e.g., 3v3 corresponds to n3=3). We have found differences in the 

11 energies reported in HITRAN, GE1SA and in SELP. The energies in SELP and HITRAN 

12 are very consistent, but we have found more significant differences between 

13 HITRAN/SELP and GEISA. Considering that SELP is more extensive (by almost a 

14 factor of two with respect to the terrestrial databases), we consider mainly the SELP 

15 energy information for our compilation. Based on the SELP/HITRAN compilation of 

16 energies, we corrected the majority of the low energy levels (<10,000 cm 1 ) in the 

17 BT2 database, and obtained excellent agreement between the frequencies of our 

18 fluorescence model and cometary data (see discussion in Section 5.1 and Fig. 6). For 

19 HDO, we correct the VTT energies using the comprehensive IUPAC survey [37]. 

20 Histograms of the corrections (theoretical - empirical) are presented in Figure 2. 

21 Unlike H1TEMP, which replaces ab initio frequencies by empirically-derived ones in 

22 those cases where both upper and lower states are known empirically, we have 
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1 additionally substituted experimentally-derived energy levels for the ab initio ones 

2 in cases where the energy of only one state (almost always the lower] is known 

3 empirically. Our approach is based on two premises: empirically-derived energy 

4 levels are in general more accurate than computed ones, and systematics are not a 

5 significant source of error in the BT2/VTT line lists. The first of these assumptions 

6 would appear to be reasonable, having regard to the relative accuracies that are 

7 claimed for the experimental and theoretical data. The second premise is supported 

8 by Fig. 2, in which the differences between the ab initio and empirical energies 

9 appear to be quasi-random, indicating that systematics are unlikely to be a major 

10 source of error in the computed values. 

11 It is easily demonstrated that the practice of substituting for only the lower (and 

12 hence the more accurate of the two levels] will on average remove exactly as much 

13 error as it introduces, reducing it in cases where A and B are in error in different 

14 directions, and increasing it in those cases where the errors on A and B are in the 

15 same direction. Overall improvements in accuracy are achieved by substituting 

16 empirical data for the less accurate (normally the upper state] ab initio value, but in 

17 practice this seldom happens. The main advantage that results when only the lower 

18 level of a pair of ab initio levels is replaced by an empirically-derived energy is not 

19 that it leads to a reduction in the sum of the absolute value of all the errors, but 

20 rather that it narrows the spread of the errors, that is to say it reduces the root 

21 mean square of the errors of all the transitions in the set and leads to a more self- 

22 consistent database, which is important when matching to experiment. The 
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1 approach has enabled us to achieve excellent agreement between observation and 

2 our fluorescence model in the frequency ranges that we have examined. 

3 3. Modeling of water in cometary atmospheres (non-LTE case) 

4 Water is the most abundant volatile in cometary atmospheres. It radiates at infrared 

5 wavelengths via non-resonance fluorescence excited by solar radiation, a full 

6 non-LTE process. Collision partners in cometary atmospheres usually lack sufficient 

7 energy to excite vibrational transitions and the rate of quenching collisions is much 

8 smaller than the radiative decay rates for (infrared active) excited states. Thus, the 

9 vibrational manifold is not populated in LTE (local thermodynamic equilibrium). 

10 Instead, solar radiation pumps the molecules into an excited vibrational state, which 

11 then de-excites by rapid radiative decay (all three vibrational modes of water are 

12 infrared active). Infrared photons are emitted through decay to the ground 

13 vibrational state, either directly (resonant fluorescence) or through branching into 

14 intermediate vibrational levels (non-resonant fluorescence). 

15 For the computation of non-resonance fluorescence, the ro-vibrational structure of 

16 the molecule must be very well characterized up to very high energies. With an 

17 effective blackbody temperature of 5778K the Sun emits radiation over a wide range 

18 of frequencies (see Figure 1), pumping the water molecule into highly excited states. 

19 In the case of a cometary atmosphere at 100K, 99% of the Solar pumping occurs via 

20 transitions with energies (E up ) lower than 7500 crm 1 , and thus the spectral 

21 databases used for computing fluorescence rates should be fully complete up to this 

22 energy limit. However, this requirement is not satisfied for terrestrial databases. 
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1 In the hypothetical case of a 296K pumping source, this requirement is greatly 

2 reduced to E up < 1100 cm 1 ; levels and transitions for this regime are very well 

3 described in HITRAN and GEISA. Before the advent of BT2/VTT and other 

4 high-energy spectral databases, fluorescence modelers had assumed that different 

5 modes of vibration are not coupled and they ultimately relied on the harmonic 

6 oscillator approximation to estimate the strength of the hot-bands from cold-bands 

7 (e.g., [13]). This approximation leads to imprecise branching ratios, fluorescence 

8 rates and line positions. Nonetheless, it allowed investigators to overcome the 

9 limitations of existing spectral databases for two decades, obtaining reasonable 

10 results for this complex problem (e.g., [1,13], and references therein). 

11 In the detailed work by Dello Russo et al. [38], the modeling of H 2 O fluorescence was 

12 advanced by adding rotational branching ratios for four bands (200-100, 200-001, 

13 101-100, 101-001) obtained from the BT2 database, while all the vibrational 

14 branching ratios (with the exception of 200 level) were computed considering the 

15 Born-Oppenheimer approximation and the latest available spectroscopic 

16 information at the time. Because of the limitations of the considered spectral 

17 databases and in the assumed solar field (see next paragraph), this previous model 

18 did not fully describe the complexity of the water spectrum (Fig. 5), although it 

19 allowed retrieval of rotational temperatures and ortho-para-ratios (OPR) from 

20 high-resolution spectra with practical accuracy. 

21 An important recent development has been the very accurate measurement of the 

22 solar radiation field. Most current cometary fluorescence models often assume the 
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1 source for solar pumping is simply a blackbody continuum with the effective 

2 temperature of the Sun. This approximation is to some extent correct for the 

3 continuum flux at certain wavelengths (2900 - 3300 cm 1 }, but it introduces 

4 important inaccuracies beyond this spectral range (see Figure 5}. In addition, the 

5 existence of Fraunhofer lines in the impinging solar radiation leads to changes in the 

6 fluorescence pumps as a function of the heliocentric velocity of the comet (the 

7 Swings effect). Omitting this effect introduces not simply a relative error, but can 

8 lead to biased retrievals of rotational and spin temperatures, since these are derived 

9 from line-by-line intensity ratios. For the pumping radiation field, we developed a 

10 synthesis of the solar spectrum using a combination of empirical parameters from 

11 the solar spectrum [39,40] calibrated with a stellar continuum flux model [41]; see 

12 Appendix B of [42], 

13 Fluorescence emission rates were computed following a four-step process: 1) we 

14 updated the energy tables of BT2 and VTT using values retrieved from the 

15 SELP/IUPAC databases (see Tables 1 and 2); 2) we synthesized a high-spectral 

16 resolution Solar spectrum at the defined cometary heliocentric velocity; 3} we 

17 calculated Solar fluorescence pumps for 500 million lines for H 2 O (BT2) and 700 

18 million lines for HDO (VTT); 4) fluorescence emission rates (g-factors) were 

19 computed for 1200 million lines (BT2+VTT) considering the appropriate branching 

20 ratios for each ro-vibrational level (see details of our General Fluorescence Model 

21 (GFM) in [42]). Vibrational pumps and cascades are presented in Figures 3 and 4, 

22 while the principal factors involved in computing line-by-line and level-by-level 

23 g-factors are presented in Figure 5. 
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1 These calculations are extremely intense, owing to the complexity of the 

2 computation and the enormous amount of data involved. To increase the 

3 computational speed, we parallelized our fluorescence model to run on 

4 multi-processor computers, ultimately leading to substantial increases in 

5 computational efficiencies. We also explored a reduction in the spectral databases 

6 considered. As presented in the cascade figures, pumps and cascades in comets with 

7 upper state energies higher than 12,000 cm 1 are negligible, since these have band 

8 rates lower than 10 7 s 1 . Introducing this cutoff in energy greatly reduces the size of 

9 the database for our calculations, by a factor of ~380 for H 2 O (1.3 million lines) and 

10 a factor of ~90 for HDO (~8 million lines). We validated this approach by comparing 

11 results based on the complete and reduced databases and observed no significant 

12 differences in predicted g-factors. These optimizations allow us to compute a 

13 complete set of cometary g-factors in a matter of hours, instead of days/weeks. As 

14 shown in Figure 5, the new model agrees very well with high-resolution data of 

15 comet Boattini obtained with NIRSPEC at Keck II (see detailed discussion in Section 

16 5.1). 

17 4. Modeling of water in planetary atmospheres (LTE case) 

18 When synthesizing spectra of water (H 2 O and HDO) on Earth and Mars (at 200-300 

19 K), HITRAN and GEISA generally contain most of the strongest lines at infrared 

20 wavelengths. As shown in Figures 3 and 4, HITRAN and GEISA are mostly complete 

21 at 296 K for H 2 O, although some spectral regions contain no empirical information 

22 for HDO. Unlike non-LTE problems, when synthesizing LTE spectra we are mainly 
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1 concerned about the ''local” consistency and precision of the parameters within a 

2 selected spectral range. For instance, missing or inconsistent intensities at a 

3 frequency of 5,000 cm 1 (i.e., at 2 pm in wavelength) do not affect our LTE modeling 

4 at 3000 cm 1 (wavelength, 3.33 pm), in contrast to cometary fluorescence emissions 

5 originating from cascades of transitions associated with diverse energies and 

6 frequencies. Consequently, we use the ''globally" consistent BT2 intensities (see 

7 previous section) for our computations of fluorescence efficiencies, while for LTE 

8 modeling we prefer the highly precise "local" spectral intensities and frequencies 

9 contained in HITRAN and GE1SA. 

10 For high temperature LTE problems (e.g., exoplanets), HITEMP is the suitable choice 

1 1 since it combines the local precision of HITRAN and other semi-empirical databases 

12 with the comprehensiveness of BT2 for high temperature computations. However, 

13 HITEMP does not yet contain the VTT information for HDO (see Figure 1). Rothman 

14 et al. [25] have also assigned line broadening coefficients to the BT2 lines in HITEMP 

15 by extrapolating the procedure in [43] to high rotation-vibration levels. We tested 

16 both types of compilations (HITRAN and HITEMP) in our radiative transfer 

17 modeling of H 2 O on Earth and Mars, and found no significant differences between 

18 them. This is expected, as HITEMP adds high-energy spectral lines of H 2 O that 

19 become significant only at temperatures much higher than those present in the 

20 atmospheres of Earth and Mars. Consequently, for the LTE radiative transfer 

21 modeling of these atmospheres we will restrict our discussion to the HITRAN 

22 spectral database. 
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1 4.1 Telluric transmittances and radiances 

2 An accurate spectral database can serve numerous LTE purposes, such as the 

3 synthesis of transmittances and radiances of planetary atmospheres by integrating 

4 these lists into advanced radiative transfer models. For instance, we compute 

5 terrestrial transmittances to assist in the analysis of spectroscopic data collected 

6 using ground-based observations (e.g., taken using NASA-1RTF, Keck II, VLT), and in 

7 particular, we spectrally calibrate our infrared data by matching the synthetic 

8 spectra to the observed telluric spectral radiance. 

9 To that end, we tested the performance of three radiative transfer models (LBLRTM, 

10 GENLN2 and SSP). The Spectrum Synthesis Program (SSP, [44]) is a robust model, 

11 although with limitations when synthesizing spectra at extremely high spectral 

12 resolutions. The latest version (v4) of GENLN2 [45] provides highly realistic and 

13 Doppler-limited spectral synthesis of the terrestrial atmosphere, but we 

14 encountered problems in the calculation of spectral line shapes since the model 

15 incorrectly accounts for pressure spectral-shifts [46]. In addition, GENLN2 is no 

16 longer supported by NCAR (National Center for Atmospheric Research). For these 

17 reasons we adopted the new efficient Line-By-Line Radiative Transfer Model 

18 (LBLRTM, [47]) maintained by AER (Atmospheric and Environmental Research). 

19 We tested the LBLRTM model extensively, and obtained excellent results (e.g., [42]). 

20 For the terrestrial layering scheme, we consider the standard tropical profile, 

21 modified to describe the local observatory conditions through two temperature 

22 parameters (T1 and T2), a pressure-scaling factor (PF) and abundance factors (AF). 
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The given pressure profile is scaled following P'(z) = P(z) x PF, while the 

2 temperature profile is divided into tropospheric (affected by Tl) and stratospheric 

3 (affected by T2). The abundance profiles are scaled by a molecular multiplier 

4 relative to the tropical value [48]. These parameters are fitted for each dataset 

5 (typically of 10 minutes) to accommodate for variations in airmass and in properties 

6 of the atmosphere (see details in [42]). We have verified that these parameters are 

7 consistent (within uncertainties) with local meteorological data (pressure, 

8 temperature). 

9 One limitation of LBLRTM, and other radiative transfer models, is that the vertical 

10 profiles of isotopologues cannot be parameterized. This is particularly problematic 

1 1 for the heavier isotopes of H 2 O (in particular for HDO) since these have lower vapor 

12 pressures than standard H2 16 0. This difference in the phase curve of the isotopes 

13 introduces important divergences in fractionation, causing the heavier water 

14 isotopes to preferentially condense and precipitate ultimately leading to their 

15 relative depletion with altitude. 

16 The deuterium abundance is normally quantified relative to the reference value 

17 (D/H = 1.5576 x lO 4 , VSMOW Vienna Standard Mean Ocean Water) in parts per mil 

18 (per thousand, %o), and consequently (for example) a parcel with 40% of its HDO 

19 removed would be described with 6D = -400 per mil. The fractionation in the 

20 troposphere is also strongly dependent on atmospheric dynamics (e.g., see 

21 convective/subsiding results in [49], and formation of clouds and atmospheric 
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1 microphysics in [50]), resulting in highly variable deuterium enrichments with 

2 respect to altitude, time and position on the planet (e.g., [51]). 

3 In the troposphere, the ratio of HDO to H 2 O is controlled mainly by Rayleigh 

4 fractionation, which explains the rapid decrease of D/H with increasing altitude in 

5 the first 15 km above the surface (Fig. 7). Water is transported from the troposphere 

6 to the stratosphere through direct convective injection and clouds [52,53], while 

7 additional water is created in the stratosphere by oxidation of CH4 and H2 by OH, Cl 

8 and O^D) [54], In a similar fashion, HDO is formed in the stratosphere from 

9 deuterated methane (CH3D). Because CH3D has shorter lifetime than CH 4 , the D/H in 

10 water increases with altitude above the tropopause (see Figure 7). Information on 

11 deuterium fractionation at higher altitudes was lacking until recently, when Sandor 

12 & Clancy [55] measured mesospheric HDO and H2 18 0 at radio wavelengths (200-226 

13 GHz), using the 12-m telescope at Kitt Peak (32°N, 112°W). Their measurements 

14 revealed extremely high variability in deuteration (6D ranged from -580 to -3 %o) in 

15 the lower mesosphere (50-70 km), which they associated with differential 

16 photolysis rates of HDO and H 2 O combined with atmospheric transport. 

17 These measurements demonstrate the importance of using realistic vertical profiles 

18 for H 2 O and HDO when synthesizing terrestrial spectra, especially in spectral 

19 regions where both isotopologues are radiatively active (e.g., in the NIR: 2v2 of H 2 O 

20 and vi of HDO sampled simultaneously with NIRSPEC). Accordingly, we compiled 6D 

21 data from several sources [52,55-60], and created a "standard" water fractionation 

22 profile (Figure 7). This vertical profile represents the "median" value measured at 


24 



1 each altitude. Considering the previously mentioned high variability of 6D, we 

2 expect this profile to change significantly with time and space. We integrated this 

3 profile into our layer-by-layer radiative transfer model. This is more realistic than 

4 simply adopting a constant value (VSMOW) at all altitudes above the tropopause, 

5 although, caution is still needed owing to the highly variable ratio of D/H in 

6 stratospheric and mesospheric water. On the other hand, the contribution to the 

7 total water column from the stratosphere and mesosphere is minimum. For instance 

8 if the water column above Mauna Kea (4,200 m) is 5 mm PWV, the contribution 

9 from above 14,000 m is only 4 pm PWV (less than 0.1%). Nevertheless, the 

10 parameterization of water above the tropopause is important when modeling 

11 certain low-energy lines that are stronger at lower temperatures, and when 

12 modeling the water spectra as measured from airborne/balloon observatories (e.g. 

13 SOFIA). 

14 Because LBLRTM does not permit a separation of the different isotopologues with 

15 altitude (layer-by-layer), we integrated the HDO vertical profile into the model by 

16 treating HDO as a distinct molecular species in the input spectral database and 

17 inside the LBLRTM routines. Apart from a realistic atmospheric description, a 

18 complete and accurate spectral database is essential for achieving correct results. 

19 The latest HITRAN version contains almost 3 million lines from 42 molecules, yet 

20 the database is still insufficient in some spectral regions. Ethane (C 2 H 6 ) provides a 

21 good example. Even though C 2 H 6 is present in the terrestrial atmosphere at only 

22 trace amounts (0.1-2 ppbv), the lines of its strongest band (V 7 ) are prominent in 

23 high-resolution atmospheric spectra. We thus extended the HITRAN database by 
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1 adding 5610 spectral lines of C 2 H 6 and 1780 lines of 4 bands of isotopic CO 2 

2 [42,46,61], 

3 We also explored performing fine corrections to the spectral parameters of the 

4 strongest lines of water present in H1TRAN. For this purpose, we used high 

5 signal-to-noise ratio data of Mars collected on 26 April 2010 using N1RSPEC at Keck 

6 11. As shown in Figure 8, after removing telluric, solar and Martian lines from our 

7 spectrum, the residuals (trace 'd') still show plenty of structure. The majority of the 

8 residual structure present in trace 'd' appears to be associated with incorrect 

9 modeling of telluric lines (see trace 'a'). 

10 In this spectral region, HITRAN2008 contains 213 lines of water, although only 24 

11 lines are needed to describe the main spectral morphology. These 24 lines have 

12 reported uncertainties of 10% for their tabulated intensities and of 0.01 cm 1 for 

13 their center frequencies (Rothman et al. 2009). Typical signal-to-noise ratios exceed 

14 500 for our Mars spectra (and sometimes approach 2000), so our spectral analyses 

15 require much higher precisions for intensities (of order 30 ppm) and line 

16 frequencies (0.0004 cm 1 ). Of course these precisions are "relative" within the 

17 spectral range, since we do not have an absolute spectral calibrator (aside from the 

18 solar lines). By performing a Levenberg-Marquardt optimization study on the line 

19 parameters, we retrieved "fine" corrections to the spectral parameters tabulated in 

20 H1TRAN, all within the reported uncertainties (see Table 3 and final residuals in 

21 trace 'f of Figure 8). This method is particularly advantageous for the analysis of our 

22 ground-based observations, since these corrections can be applied to all our 
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1 datasets, including those obtained with other instruments (e.g., CSHELL at 

2 NASA-IRTF, CRIRES at VLT). 

3 

4 4.2 Planetary transmittances and radiances: the Mars case 

5 Extrapolating the modeling presented in the previous section (section 4.1) to other 

6 planetary atmospheres is far from trivial. The layering, geometry, radiative transfer 

7 model, and the spectral lists need to be tailored to the completely different 

8 conditions present on other planets. In particular, the line broadening parameters 

9 contained in H1TRAN have been retrieved for typical pressures and temperatures 

10 present on Earth and its N 2 rich atmosphere. As demonstrated by laboratory 

11 experiments (e.g., [32,62]) these parameters are significantly different for a carbon 

12 dioxide atmosphere (e.g., Mars and Venus). 

13 Planetary scientists have used different methods to take this effect into account, 

14 some by scaling the N 2 -broadening parameters by a constant factor ( 1.3 : [63], 1.6: 

15 [64], 1.3 : [65], 1.5: [66], 1.7: [67]), while others (e.g., [68,69]) use the CO 2 -H 2 O line 

16 list presented by [70] for lines in the 200-900 cm 1 spectral range. The formalism in 

17 [70] considered only the real part of the Robert-Bonamy formulation, but it was 

18 recently found that the imaginary terms could change the value of the half-widths by 

19 as much as 25% and that the full complex calculations gave much better agreement 

20 with laboratory measurements [32,62], We therefore calculated half-widths and 

21 line-shifts for a CO 2 atmosphere (suitable for Mars and Venus) by applying the 

22 methodology presented in [62], with the experimental coefficients presented in 
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1 [32], assuming that the half-widths are not dependent on vibration state. This 

2 approximation should be reasonable for the half-width and its temperature 

3 dependence. 

4 We integrated this “adapted" spectral database into our Martian radiative transfer 

5 model (MRTM) based on the terrestrial LBLRTM code (see above]. The 

6 transformation of LBLRTM to synthesize Martian spectra was achieved by disabling 

7 the telluric layering scheme contained in the package and by setting up a 

8 "user-defined" scheme that considers the correct geometric observing conditions 

9 and the necessary corrections that take into account the sphericity of the planet. 

10 We consider 30 layers and a maximum altitude of 100 km for the layering scheme, 

11 with layer thickness increasing from 10 meters near the surface to 10 km for the 

12 upper atmosphere. Our approach follows a similar scheme used in [71] and that 

13 used for water retrievals from MGS/TES data [72] and MEX/PFS data [73]. The 

14 optical path of each layer was corrected to compensate for the curved atmosphere 

15 relative to the planetary radius (3396 km for Mars] following the methodology 

16 described in [74], Local atmospheric conditions and temperature profiles at each 

17 (surface] footprint were estimated using the Mars Climate Database (v4.3, [75]) for 

18 the appropriate solar longitude and Martian local time. Effects of dust and ice 

19 aerosols were not included in the radiative transfer. 

20 We explored the possibility of integrating the MGS-TES scattering model [66] into 

21 our atmospheric model, but the high degree of uncertainty in the input parameters 

22 (dust vertical profile, dust properties, dust abundances, ice clouds parameterization, 


28 



1 


etc.} prevented us from achieving reliable results. Fortunately, the broad spectral 

2 grasp of CRIRES (VLT] and NIRSPEC (Keck 11} allowed us to measure H 2 0 

3 simultaneously with a newly discovered band of C0 2 [61], permitting us to retrieve 

4 accurate mixing ratios for H 2 0 and other trace species at every footprint on Mars 

5 (with topographic corrections automatically included}. Results using this new model 

6 are presented in Figures 8 and 9 (see also Section 5.2). 

7 

8 5. Discussion 

9 5.1 Modeling of water fluorescence frton-LTEI: comparison to cometarv data 

10 We validated our water fluorescence model by comparing it to infrared data of 

11 comet C/2007 W1 (Boattini) acquired in July 2008 using the Near-lnfraRed Echelle 

12 SPECtrograph (NIRSPEC, [76]) at the Keck II Telescope in Hawaii. In addition to H 2 0 

13 and HDO, we sampled 11 other gases using three instrument settings (KL1, KL2, 

14 MWA). All settings sample water lines, and therefore mixing ratios of trace species 

15 (relative to water) are retrieved with high confidence. On July 9, we sampled HDO, 

16 CHsOH, C 2 H 6 and H 2 0 with the KL1 setting, and HCN, C 2 H 2 , NH 3 , CH 4 , C 2 H 6 , H 2 CO and 

17 H 2 0 with the KL2 setting. On July 10, we repeated the KL1 setting and then sampled 

18 the fundamental band of CO (5 lines, v = 1 - 0) along with H 2 0 (3 lines) in the MWA 

19 setting near 4.7 pm. The cross-dispersed frames were processed and calibrated 

20 following our standard methodology for analyzing two-dimensional (spectral and 

21 spatial) data [5], 
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1 From each spectral order, we extracted nucleus-centered spectra after binning 9 

2 spatial rows along the slit, together spanning ± 0.9 arc-sec or ± ~ 230 km from the 

3 nucleus. Rotational temperatures (T ro t) were then obtained for water by correlation 

4 and excitation analyses [77-79]. This process requires flux measurements for 

5 multiple lines spanning a broad range of excitation energies, along with line-by-line 

6 fluorescence models tailored for a range of temperatures. For the rotational 

7 analysis, we included lines measured in both L-band settings. KL1 samples lines of 

8 fairly low rotational excitation that usefully constrain the lower bounds for T ro t, 

9 while KL2 samples a number of lines of high rotational excitation thereby bounding 

10 the high temperature end of acceptable T ro t values. From the excitation analysis we 

11 also extract the abundance of each spin species, and the ortho-para ratio (see details 

12 in [7]). 

13 For each dataset (July 9 th and 10 th 2008), we computed fluorescence models using 

14 the cometary ephemerides at the time of the observations: Rh = 0.893 - 0.899 AU 

15 (heliocentric distance), Vh = 9.77 - 10.36 km s' 1 (heliocentric velocity of the comet), A 

16 = 0.348-0.356 AU (geocentric distance), vd = 12.91-12.98 km s 1 (geocentric 

17 velocity of the comet). The fluorescence models were then multiplied by telluric 

18 transmittances synthesized with the methodology presented in section 4.1. From 26 

19 water lines (10 para and 16 ortho) measured on 9 July, we retrieved a 

20 nucleus-centered rotational temperature of 80 ± 2 K and an OPR of 2.87 ± 0.15 (T sp in 

21 > 34K). Using data taken on both nights (9 and 10 July UT, with a total of 40 minutes 

22 of integration), we obtained an upper limit of 8.3 VSMOW (3-sigma) for D/H in 

23 water in Comet Boattini (see results for other comets in [5] and refs, therein). 
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1 The modeled and measured intensities for individual lines agree remarkably well, 

2 and the model successfully describes the general morphology of the water spectrum 

3 in both [L-band and M-band) settings (Figure 6). The frequency corrections applied 

4 to the BT2 energies (section 2.2) are of critical importance, as shown in the 'b' and 

5 'c' panels. With a high-resolution spectral sampling of 0.04 cm 4 /pixel, corrections 

6 up to 0.3 cm 4 (see Fig. 2) would represent significant shifts up to 7 pixels. These 

7 energy corrections, in addition to the use of a self-consistent spectral database, and 

8 the introduction of a realistic solar fluorescence pumping, have led to considerable 

9 improvements in the agreement between synthesized and measured line-by-line 

10 intensities. 

11 When compared to the retrievals with the previous water fluorescence model, the 

12 new model increases the precision of the retrieved production rate by a factor or 

13 ~2, and leads to an overall correction of ~15% to the previous values. For instance, 

14 with the current model we obtain QffhO) = 116.25 ± 9.47, while the previous model 

15 resulted in QfFbO) = 133.45 ± 20.15 (both in units of 10 26 molecules s 4 ). Note the 

16 difference in both the values of the retrieved production rates and in the ratio of 

17 production rate and error (the confidence limits are improved by about a factor of 

18 two by the new model, in this example). The main reason for the improvement in 

19 the latter is the improved agreement between predicted and observed line fluxes, 

20 with correspondingly smaller confidence limits in the retrieved production rate. For 

21 instance, consider the two lines originating from the 200-221 level (200-221 to 100- 

22 330 at 3394.1 cm 4 ; and 200-221 to 001-322 at 3372.8 cm 4 ); the notation is (m n2 n3 - 

23 |K a Kc). Previously these lines were systematically excluded from quantitative 
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1 analyses because older models predicted their intensities to be ~1.5 - 2 times 

2 weaker than observed (see Figure 2). A more accurate modeling of both the solar 

3 pump rate, spectral completeness and ro-vibrational branching ratios resulted in 

4 very good agreement with observations, and permitted their inclusion in our 

5 retrievals of rotational temperatures, ortho-para ratios, and production rates. 

6 

7 5.2 Modeling of water in planetary atmospheres fLTE): application to Mars data 

8 We validated our model for planetary atmospheres by analyzing NIRSPEC spectra of 

9 Mars. Measurements of water on Mars (with a temperature similar to Earth's) are 

10 only possible when Mars is at a high relative Doppler-shift (> 10 km s _1 ), and thus 

11 we scheduled our water observations in April 2010 when the relative velocity 

12 between Mars and Earth reached its maximum during this apparition. On 26 April 

13 2010, we oriented the NIRSPEC slit (0.144” x 24”) East-West over the center of the 

14 planetary disk, and sampled mid-latitudes north of Valles Marineris. 

15 The cross-dispersion capability of NIRSPEC allowed us to sample a wide range of 

16 frequencies across six spectral orders in the L-band. Order #21 of our KL1 setting 

17 sampled multiple lines of HDO (and CO 2 ) near 3.7 pm, while order #25 sampled 

18 multiple lines of H 2 O (and CO 2 ) near 3.1 pm (see Figs. 8 and 9). The simultaneous 

19 measurement of CO 2 with the isotopologues of water allowed us to retrieve 

20 high-confidence absolute mixing ratios for H 2 O and HDO relative to CO 2 , since all 

21 gases sampled the same mean topography (footprint) on Mars. The combination 

22 provided highly accurate D/H ratios for the sampled region on Mars. The fine 
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corrections applied to the spectral parameters allowed us to obtain high quality 

2 residuals for H 2 O and HDO, and ultimately for D/H in water (compare traces 'e' and 

3 'f of Figure 8, and traces 'c' and 'd' of Figure 9], In the presence of significant aerosol 

4 opacities (e.g., global dust storm], the retrieved D/H will be only of the column 

5 above the aerosol cloud. We can infer the existence of aerosols, by comparing our 

6 observed atmospheric columns (derived from the CO 2 measurements) with those 

7 predicted by the GCM. In general, we obtain very good agreement with the predicted 

8 pressures estimated by GCM models (e.g., [61,80]), implying a general small impact 

9 to the derivation of D/H by aerosols. 

10 Considering a spectral extract of data taken on 26 April from the center of the 

1 1 planetary disk (19°N and 55°W, 9:30AM local time on Mars, L s 82° of Mars Year 30), 

12 we retrieved a vertical column density (correcting for observing geometry) of 4.636 

13 ± 0.235 pr-pm for H 2 O, and of 10.127 ± 0.367 pr-nm for HDO, leading to a ratio of 7.0 

14 ± 0.4 VSMOW for D/H in water at this position on the planet. The retrievals were 

15 based on the residuals presented in Figs. 8 and 9, using spectral parameters of H 2 O 

16 and HDO adapted to Mars (CO 2 atmosphere) (see Section 4.2). Our localized 

17 deuterium enrichment factor (D/H = 7.0 ± 0.4 VSMOW) is larger than the full-disk 

18 results of [81] (5.0 ± 0.2 VSMOW) and [82] (5.8 ± 2.6 VSMOW). This difference is 

19 actually expected, since the full disk measurements also sample regions of low 

20 deuterium as observed by [83] and modeled by [84], Three-dimensional modeling 

21 of the D/H cycle on Mars predicts large depletions of deuterium at high-latitudes, 

22 mainly due to the formation of water ice clouds in which HDO is strongly enhanced 

23 [84], 
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This important deuterium enrichment of water on Mars is probably indicative of a 

2 significant loss of water, because of the preferential escape of the lighter form of 

3 water over geological times. How much water was lost and when this loss mainly 

4 occurred are (once again) topics of intense debate. The two main elements used to 

5 infer the loss of water over time (D/H ratios in present atmospheric water, and in 

6 ancient water from Martian meteorites) are based on highly disputed results. 

7 Previous studies of Martian meteorites have shown highly variable D/H and lower 

8 values (~2 VSMOW) than current atmospheric values (e.g., [85]). However, 

9 Greenwood et al. [86] reported a D/H of 4 VSMOW for the ancient ALH84001 

10 meteorite (4.5 — 3.9 Ga) and 5.6 VSMOW for the young shergottites (0.17 Ga), and 

11 suggested that the earlier measurements may have been biased by significant 

12 terrestrial contamination. Greenwood et al. ultimately concluded that Mars lost the 

13 majority of its water by 3.9 Ga. However, little is known about the current reservoirs 

14 of water on Mars and their D/H content. The fact that we observe strong 

15 geographical and seasonal variability of D/H on Mars [83,87] may indicate multiple 

16 water reservoirs of varying sizes, that gain and lose water to the atmosphere as 

17 functions of time [88]. 

18 6, Conclusions 

19 We developed a comprehensive methodology to measure water (H 2 O) and 

20 deuterated water (HDO) in diverse environments and under a wide range of 

21 excitation conditions. The models make use of the latest spectroscopic databases (of 

22 both empirical and theoretical origins) for H 2 O and HDO, and they incorporate 

23 corrections retrieved from high-resolution spectra obtained using NIRSPEC at Keck 
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1 II. Because most observations of water in the Universe are performed using 

2 ground-based telescopes, precise modeling of telluric transmittances is of crucial 

3 importance. The new model incorporates the strong vertical fractionation of 

4 deuterated water found in our own atmosphere, which was not considered in 

5 previous terrestrial radiative transfer models. 

6 We addressed this important issue by integrating our spectral compilation of water, 

7 CO 2 and C 2 H 6 lines into an advanced radiative transfer model of the terrestrial 

8 atmosphere (LBLRTM), which features a rigorous line-by-line, layer-by-layer 

9 radiative transfer analysis and includes realistic atmospheric conditions, abundance 

10 and isotopic profiles, and geometrical conditions. The terrestrial model was also 

11 extended for use on Mars by modifying the layering scheme and by calculating 

12 half-widths and line-shifts for a CO 2 atmosphere using the complex Robert-Bonamy 

13 formalism. 

14 Modeling of water in non-LTE environments was achieved by developing a full 

15 non-resonance fluorescence model with cascade (500 million H 2 O lines and 700 

16 million HDO lines). This model utilizes a novel approach to synthesize the solar 

17 pump and marries the comprehensiveness of the BT2 database with the spectral 

18 precision of semi-empirical databases (SELP/IUPAC/HITRAN/GEISA). Energy tables 

19 and other lists are available at http://astrobiology.gsfc.nasa.gov/Villanueva/spec.html. 

20 We successfully validated the methodology by retrieving water and D/H on Mars, in 

21 C/2007 W1 (Boattini) and on Earth, from data obtained using the high-resolution 

22 spectrograph NIRSPEC/Keck II atop Mauna Kea (Hawaii). The new model accurately 
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describes the complex morphology of the water bands and greatly increases the 

2 accuracy of the retrieved abundances (and the D/H ratio in water) with respect to 

3 previously available models. We expect that this newly developed methodology for 

4 retrieving H 2 O and HDO in planetary atmospheres will assist in unraveling the true 

5 history of water in our Solar System and beyond. 
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Table 1 . Summary of spectral lines and energy level information retrieved from the 
four principal repositories of spectroscopic parameters for water (H2O) * 



GEISA 

% 

HITRAN 

% 

BT2 

% 

HITEMP 

% 

Number of HaO lines 

41,447 

100.0 

37,432 

100.0 

505,806,255 

100.0 

114,209,395 

100.0 

Flagged symmetry (a) 

N/A 

N/A 

0 

0.0 

84,699 

0.0 

13,507 

0.0 

Flagged line ID (b) 

69 

0.2 

69 

0.2 

5,423 

0.0 

677 

0.0 

Unknown line ID (c) 

2,422 

5.9 

882 

2.4 

491,191,064 

97.1 

109,568,331 

95.9 

Number of levels 

8,072 

100.0 

8,762 

100.0 

221,097 

100.0 

28, 598(f) 

100.0 

Energy consistency (d) 









better than 0.3 cm 1 

7,841 

97.1 

8,674 

99.0 





better than 0.1 cm- 1 

7,792 

96.5 

8,670 

99.0 





better than 0.01 cm 1 

7,686 

95.2 

8,598 

98.1 





better than 0.001 cnr 1 

6,931 

85.9 

7,581 

86.5 





BT2 energy levels (e) 

corrected using GEISA 





7,044 

3.2 



corrected using HITRAN 





7,486 

3.4 



corrected using SELP(f) 





11,989 

5.4 




*The four repositories are: GEISA [31], H1TRAN [29], BT2 [20], HITEMP [25]. 

(a) Incorrect symmetries were determined by comparing the reported symmetry (statistical weight 
in HITRAN and HITEMP) and the corresponding symmetry derived from the reported level 
quanta (K a , K c and m). 

(b) Lines accessing levels with energies lower than the corresponding vibrational energy (i.e. 
negative rotational energies) were flagged as having incorrect quanta. 

(c) Lines where the upper or the lower level was not fully known were flagged as having unknown 
ID. 

(d) GEISA and HITRAN are repositories of line positions (not of energy levels), and thus different 
lines accessing the same line report slightly different energies. The consistency of the energy 
information contained in these terrestrial databases was tested by computing statistics 
(standard deviation and weighted mean) on the reported lower and upper energies of all lines. 

(e) The theoretically calculated energies of BT2 were corrected using semi-empirical information 
when the difference between the empirical and theoretical values was lower than 0.3 cm 1 . 

(f) The majority (55% = 15,606/28,598) of the H 2 O energy information of HITEMP is based on the 

semi-empirical line position (SELP) atlas that contains 15,606 energy levels. SELP is based on 
the database of experimental energy levels reported by Tennyson et al. [35], and includes recent 
updates obtained using high-temperature experiments (e.g., [36]). 
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Table 2. Summary of spectral lines and energy level information retrieved 
from the four principal repositories of spectroscopic parameters for 
deuterated water (HDO).* 



GEISA 

% 

HITRAN 

H1TEMP 

% 

VTT 

% 

Number of HDO lines 

11,980 

100.0 

13,238 

100.0 

697,450,825 

100.0 

Flagged symmetry (a) 

N/A 

N/A 

0 

0.0 

N/A 

N/A 

Flagged line ID (b) 

1 

0.0 

1 

0.0 

79,042,333 

11.3 

Unknown line ID (c) 

147 

1.2 

2 

0.0 

145,936,238 

20.9 

Number of levels 

2,050 

100.0 

3,350 

100.0 

163,491 

100.0 

Energy consistency (d) 







better than 0.3 cm 1 

2,043 

99.7 

3,350 

100.0 



better than 0.1 cm 1 

2,037 

99.4 

3,349 

100.0 



better than 0.01 cnr 1 

1,959 

95.6 

3,283 

98.0 



better than 0.001 cnr 1 

1,820 

88.8 

2,871 

85.7 



VTT energy levels (e] 

corrected using GEISA 





1,534 

0.9 

corrected using H1TRAN 





2,539 

1.6 

corrected using IUPAC 





5,287 

3.2 


* The three main repositories for HDO are: GE1SA, H1TRAN, VTT [21] and the IUPAC HDO 
survey [37], 

Labels (a), (b), (c), (d) and (e) are as described in Table 1. 
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Table 3. Corrections applied to the HITRAN database, based on telluric 
transmittances observed with N1RSPEC at Keck II (Mauna Kea, Hawaii).* 


Upper 
v, v 2 
V3 

Upper 

JKa Kc 

Lower 
J Ka Kc 

Line Strength 
Sold S new 

Diff 

(%) 

o 

(%) 

VOLD 

(cm 1 ) 

VNEW 

( cm - 1 ) 

Diff 

( cm 1 ) 

cr 

( cm 1 ) 

020 

3 

2 

1 

3 

1 

2 

124.70 

123.60 

- 0.874 

0.002 

3219.3835 

3219.3828 

- 0.0008 

0.0002 

020 

4 

2 

2 

4 

1 

3 

35.90 

36.10 

0.569 

0.003 

3220.4421 

3220.4430 

0,0009 

0.0002 

020 

2 

2 

0 

2 

1 

1 

31.14 

31.88 

2.365 

0.002 

3222.0347 

3222.0371 

0.0024 

0.0002 

020 

7 

2 

5 

8 

1 

8 

0.54 

0.48 

- 9.850 

0.019 

3223.3258 

3223.3260 

0.0003 

0.0018 

1 00 

7 

1 

6 

8 

4 

5 

4.00 

4.07 

1.709 

0.005 

3225.7061 

3225.7105 

0,0044 

0.0004 

1 00 

7 

0 

7 

8 

3 

6 

4.00 

4.01 

0.280 

0.005 

3226.0686 

3226.0719 

0.0033 

0.0004 

020 

5 

2 

3 

5 

1 

4 

67.37 

61.96 

- 8.026 

0.002 

3227.4646 

3227.4678 

0.0032 

0.0002 

020 

3 

i 

3 

2 

0 

2 

61.72 

62.85 

1.831 

0.002 

3229.9003 

3229.9021 

0.0018 

0.0002 

00 1 

8 

4 

4 

9 

6 

3 

3.28 

3.32 

1.086 

0.009 

3230.4202 

3230.4210 

0.0009 

0.0007 

I 00 

7 

7 

0 

8 

8 

i 

3.53 

3.63 

2.950 

0,010 

3230.9833 

3230.9849 

0.0016 

0.0005 

020 

4 

1 

3 

3 

2 

2 

10.65 

10,92 

2.510 

0.003 

3232,2735 

3232.2744 

0.0009 

0.0002 

020 

4 

0 

4 

3 

1 

3 

52.85 

53.02 

0.314 

0.002 

3233.0194 

3233.0174 

- 0.0020 

0.0002 

020 

2 

2 

1 

2 

1 

2 

71.41 

72.88 

2,065 

0,002 

3236.6489 

3236.6480 

- 0.0009 

0.0002 

020 

5 

1 

4 

5 

0 

5 

45.59 

42.92 

- 5.856 

0.002 

3240.1067 

3240.1054 

- 0.0013 

0.0002 

020 

6 

2 

4 

6 

1 

5 

10.89 

10.79 

- 0.950 

0,003 

3241.7733 

3241.7739 

0.0006 

0.0002 

! 00 

8 

2 

7 

9 

3 

6 

4.36 

4.31 

- 1.181 

0.005 

3243.0450 

3243.0464 

0.0014 

0.0005 

020 

5 

2 

3 

4 

3 

2 

7.14 

7.11 

- 0.431 

0.005 

3244.4053 

3244.4054 

0.0001 

0,0003 

020 

4 

1 

4 

3 

0 

3 

181.50 

184.30 

1.524 

0.003 

3244.9426 

3244.9428 

0.0002 

0.0005 

020 

3 

2 

2 

3 

1 

3 

26.79 

27.60 

3.014 

0.007 

3245.4021 

3245.4037 

0.0016 

0.0004 

00 1 

7 

4 

4 

8 

6 

3 

5.33 

5.31 

- 0.352 

0.005 

3247.3633 

3247.3638 

0.0005 

0.0004 

020 

5 

0 

5 

4 

1 

4 

145.60 

146.10 

0.375 

0.002 

3254.1481 

3254.1465 

- 0.0016 

0.0002 

1 00 

7 

6 

1 

8 

7 

2 

7.58 

7.29 

- 3.914 

0.006 

3256.0857 

3256.0886 

0.0029 

0.0003 

020 

4 

2 

3 

4 

1 

4 

63.34 

62.45 

- 1.401 

0.002 

3257.2261 

3257.2267 

0.0006 

0.0002 

1 00 

8 

5 

4 

9 

6 

3 

3.90 

3.87 

- 0.797 

0.006 

3258.0741 

3258.0752 

0.0011 

0.0005 


*AI1 lines originate from the ground-vibrational state, and have reported uncertainties of 10% 
(error code 5, see Rothman et al. 2005) for the intensities, and 0.01 cm 1 for the frequencies (error 
code 3). The line intensities (Sold, Smew) are given in units of 10 23 cnr ! /(molecule cm 2 ) at 296K. All 
required corrections are within la of the HITRAN reported uncertainties. See retrieval results in 
Figure 8. 
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Table 4. Water on Earth and Mars, and in comet C/2007 W1 (Boattini). 


Molecule 

Column abundance 

D Enrichment 
Factor, 

w.r.t. VSMOW O) 

Mars - Water column at 19°N and 55°W (9:30AM) at L s 82° (MY 30), 26 April 2010 

H 2 O (water) 

HDO (deuterated water) 

4.636 ± 0.235 pr-pm 
10.127 ± 0.367 pr-nm 

7.01 ±0.44 

Earth - Water column above Mauna Kea (Hawaii) at 4,145 m, 26 April 2010 09 

H 2 O (water) 

HDO (deuterated water) 

1.304 ± 0.070 pr-mm 
0.212 ± 0.017 pr-pm 

0.52 ±0.03 


Isomer / Production rate 

isotopologue (molecules s 1 ) 

Production rate 
(molecules s 1 ) 
and OPR 

D Enrichment 
Factor, 

w.r.t. VSMOW W 

Comet C/2007 W1 (Boattini), 9 July 2008, NIRSPEC at Keck II M 

H 2 0- Ortho (0.89 ± 0.02) xlO 28 

H 2 0 - Para (0.31 ± 0.01) x 10“ 

(1.20 ± 0.02) x 10 28 s 1 

OPR = 2.87 ±0.15 

< 8.3 (3a) 

HDO <2.73xlOZ5(3o) 



(a) The Vienna Standard Mean Ocean Water (VSMOW) is the standard defining the water isotopic 
composition of Earth's oceans - (D/H)vsmow = 155.76 ppm. 

(b) The reported uncertainties for the columns and isotopic ratios reflect the level of variability of 
the water columns during the observations on 26 April 2010 (05:30 - 08:42 UT) using N1RSPEC at 
Keck II atop Mauna Kea. Systematic uncertainties originating from the assumed Martian 
atmospheric conditions (temperature/pressure estimated by the MCD model) are smaller (<3%) 
than the intrinsic precision of the measurement. 

(c) Details on the chemical composition and retrievals for comet Boattini are presented in [79], 
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Figure 1. Comparison of the five principal spectroscopic databases of water (H 2 O; 
left) and of deuterated water (HDO; right). The upper panel shows normalized 
blackbody radiation curves for characteristic temperatures of Mars, Earth and the 
Sun. The middle panel shows integrated line intensities (cm V(molecule cm 2 )) for 
the four databases within 10 cm- 1 spectral bins at a temperature of 296K. The 
lowest panel shows number of lines within 10 cm 1 spectral bins. 
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2 

3 Figure 2. Comparison of the energy-level characterization of HITRAN, SELP, 

4 H1TEMP, BT2, VTT and HD0-1UPAC databases. The leftmost and rightmost panels 

5 show the level density from 0 to 30,000 cm 1 considering 300 cm 1 bins for H 2 O and 

6 HDO respectively. The middle panels show histograms of the difference in energy 

7 between BT2/VTT and semi-empirical values where the databases have matching 

8 ro-vibrational identifications (see text for details]. The theoretically calculated 

9 energies of BT2 and VTT were corrected used information from the semi-empirical 

10 databases when the difference was lower than 0.3 cnr 1 (dotted lines] - see Tables 1 

11 and 2. 
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Figure 3. Diagram showing full non-resonance fluorescence for H 2 O in a comet at 1 
Astronomical Unit (AU] and with a rotational temperature of 100 K. The pumping 
rates (shown in blue] were calculated considering a realistic Solar model, and the 
emission rates (shown in red/green/purple/yellow colors) were calculated by 
subsequent cascade down to the ground-vibrational level and considering 
line-by-line and level-by-level branching ratios which take into account all 500 
million transitions. Only pumps/emissions with vibrational rates higher than 10 7 s' 1 
are shown. 
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3 

4 Figure 4. Diagram showing full non-resonance fluorescence for deuterated water 

5 (HDO] in a comet at 1 Astronomical Unit (AU) and with a rotational temperature of 

6 100 K. Only pumps/emissions with rates higher than 10' 7 s- 1 are shown, and the 

7 rates are in photons per second per HDO molecule. 

8 
9 

10 


45 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 



(d) 


u x 0 
n _ 

6 ) jo 


: ■ 

If 


lUt 


j Jf ij y* h h* * -'p f 4* - * k- 




*r 


NEW 


4w 


Ortho | 
Para 


4 Previous 


3480 


3460 


3440 3420 

Frequency [cm 1 ] 


3400 


3380 


Figure 5. Diagram showing the main elements involved in the modeling of the water 
fluorescence emission, (a) Solar spectrum, which is a combination of a theoretical 
continuum model and a highly precise solar line list. The red trace shows the 
historically considered fluorescence pump for comets (a 5778 K blackbody), and the 
blue vertical traces the position of the main H 2 O pumps presented in Figure 3. (b) 
Atmospheric transmittance atop Mauna Kea, synthesized using LBLRTM [42,47], 
showing strong H 2 O and CO 2 absorptions at 3700 cnr 1 and 2350 cm 1 , (c) Stick 
spectrum of the newly developed water fluorescence spectrum, where the ‘red’ 
trace indicates fluorescence efficiencies observable only at the Top Of the 
Atmosphere (TOA), and ‘green’ trace indicates the observable flux with a telescope 
atop Mauna Kea (note the logarithmic ordinate scale), (d) A comparison of the new 
model and the previous model [38] fora rotational temperature of 100K. 
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